A General Method for Multi Agent Reinforcement Learning in Unrestricted Environments
نویسندگان
چکیده
Previous approaches to multi agent reinforcement learning are either very limited or heuristic by na ture The main reason is each agent s environment continually changes because the other agents keep changing Traditional reinforcement learning algo rithms cannot properly deal with this This paper however introduces a novel general sound method for multiple reinforcement learning agents living a single life with limited computational resources in an unrestricted environment The method properly takes into account that whatever some agent learns at some point may a ect learning conditions for other agents or for itself at any later point It is based on an e cient stack based backtracking procedure called environment independent reinforcement accel eration EIRA which is guaranteed to make each agents learning history a history of performance im provements long term reinforcement accelerations The principles have been implemented in an illustra tive multi agent system where each agent is in fact just a connection in a fully recurrent reinforcement learning neural net
منابع مشابه
Voltage Coordination of FACTS Devices in Power Systems Using RL-Based Multi-Agent Systems
This paper describes how multi-agent system technology can be used as the underpinning platform for voltage control in power systems. In this study, some FACTS (flexible AC transmission systems) devices are properly designed to coordinate their decisions and actions in order to provide a coordinated secondary voltage control mechanism based on multi-agent theory. Each device here is modeled as ...
متن کاملAn Unsupervised Learning Method for an Attacker Agent in Robot Soccer Competitions Based on the Kohonen Neural Network
RoboCup competition as a great test-bed, has turned to a worldwide popular domains in recent years. The main object of such competitions is to deal with complex behavior of systems whichconsist of multiple autonomous agents. The rich experience of human soccer player can be used as a valuable reference for a robot soccer player. However, because of the differences between real and simulated soc...
متن کاملIncremental Self-improvement for Life-time Multi-agent Reinforcement Learning
Previous approaches to multi-agent reinforcement learning are either very limited or heuris-tic by nature. The main reason is: each agent's or \animat's" environment continually changes because the other learning animats keep changing. Traditional reinforcement learning algorithms cannot properly deal with this. Their convergence theorems require repeatable trials and strong (typically Markovia...
متن کاملImproving Agent Performance for Multi-Resource Negotiation Using Learning Automata and Case-Based Reasoning
In electronic commerce markets, agents often should acquire multiple resources to fulfil a high-level task. In order to attain such resources they need to compete with each other. In multi-agent environments, in which competition is involved, negotiation would be an interaction between agents in order to reach an agreement on resource allocation and to be coordinated with each other. In recent ...
متن کاملUtilizing Generalized Learning Automata for Finding Optimal Policies in MMDPs
Multi agent Markov decision processes (MMDPs), as the generalization of Markov decision processes to the multi agent case, have long been used for modeling multi agent system and are used as a suitable framework for Multi agent Reinforcement Learning. In this paper, a generalized learning automata based algorithm for finding optimal policies in MMDP is proposed. In the proposed algorithm, MMDP ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002